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As a tutorial to the spatial aspects of Spontaneous Parametric Downconversion (SPDC), 
we present a detailed first-principles derivation of the transverse correlation width of 
photon pairs in degenerate collinear SPDC. This width defines the size of a biphoton 
birth zone, the region where the signal and idler photons are likely to be found when 
conditioning on the position of the destroyed pump photon. Along the way, we dis¬ 
cuss the quantum-optical calculation of the amplitude for the SPDC process, as well 
as its simplified form for nearly collinear degenerate phase matching. Following this, 
we show how this biphoton amplitude can be approximated with a Double-Gaussian 
wavefunction, and give a brief discussion of the measurement statistics (and subsequent 
convenience) of such Double-Gaussian wavefunctions. Next, we use this approximation 
to get a simplified estimation of the transverse correlation width, and compare it to 
more accurate calculations as well as experimental results. We then conclude with a 
discussion of the concept of a biphoton birth zone, using it to develop intuition for the 
tradeoff between the first-order spatial coherence and bipohoton correlations in SPDC. 
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I. INTRODUCTION 

In continuous-variable quantum information, there are 
many experiments using entangled photon pairs gener¬ 
ated by spontaneous parametric downconversion (SPDC) 
1 . In short, SPDC is a -nonlinear optical process 
occurring in birefringent crystals 2 where high energy 
“pump” photons are converted into pairs of low energy 


1 There are many dozens (if not hundreds) of experimental pa¬ 
pers either using or exploring the spatial entanglement between 
photon pairs from SPDC, but some papers representative of the 
scope of research are: (Ali Khan and Howell, 2006; Barreiro et al., 
2008; Bennink et al., 2004; Brougham and Barnett, 2012; Edgar 
et al., 2012; Howell et al., 2004; Howland and Howell, 2013; Leach 
et al., 2012; Mazzarella et al., 2013; Moreau et al., 2012; Reid 
et al., 2009; Schneeloch et al., 2013; Tasca et al., 2011; Walborn 
et al., 2010, 2011) 

2 In order to produce SPDC, one does not necessarily need a bire¬ 
fringent crystal, but this is a popular way to ensure a constant 
phase relationship (also known as phase matching) between the 
pump photon, and the signal/idler photon pair. 
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“signal” and “idler” photons. In particular, the pump 
field interacts coherently with the electromagnetic quan¬ 
tum vacuum via a nonlinear medium in such a way that 
as an individual event, a pump photon is destroyed, and 
two daughter photons (signal and idler) are created (this 
event happening many times). As this process is a para¬ 
metric process (i.e., one by definition in which the initial 
and final states of the crystal are the same), the total 
energy and total momentum of the field must each be 
conserved. Because of this, the energies and momenta 
of the daughter photons are highly correlated, and their 
joint quantum state is highly entangled. These highly en¬ 
tangled photon pairs may be used for any number of pur¬ 
poses, ranging from fundamental tests of quantum me¬ 
chanics, to almost any application requiring (two-party) 
quant um ent anglement. 

In this tutorial, we discuss a particularly convenient 
and common variety of SPDC used in quantum optics ex¬ 
periments. In particular, we consider illuminating a non¬ 
linear crystal with a collimated continuous-wave pump 
beam, and filtering the downcoverted light to collect only 
those photon pairs with frequencies nearly equal to each 
other (each being about half of the pump frequency). 
This degenerate collinear SPDC process is amenable to 
many approximations, especially considering that most 
optical experiments are done in the paraxial regime, 
where all measurements are taken relatively close to the 
optic axis (allowing many small-angle approximations). 
With this sort of experimental setup in mind, we dis¬ 
cuss the theoretical treatment of such entangled photon 
pairs (from first principles) in sufficient detail so as to 
inform the understanding and curiosity of anyone seek¬ 
ing to discuss or undertake such experiments 3 . Indeed 
as we discuss all the necessary concepts preceding each 
approximation, much of this discussion will be useful in 
understanding non-collinear and non-degenerate SPDC 
as well. 

The rest of this paper is laid out as follows. In Section 
2, we discuss the derivation of the quantum biphoton field 
state in SPDC, as discussed in (Hong and Mandel, 1985), 
and (Mandel and Wolf, 1995). In addition to this, we 
point out what factors contribute not only to the shape 
of the biphoton wavefunction (defined later), but also to 
the magnitude of the amplitude for the biphoton genera¬ 
tion to take place. This is important, as it determines the 
overall likelihood of downconversion events, and gives im¬ 
portant details to look for in new materials in the hopes 
of creating brighter sources of entangled photon pairs. 
In Section 3, we simplify the biphoton wavefunction for 


3 For more extensive treatments of spontaneous parametric down- 
conversion, we recommend the Ph.D. theses of Lijun Wang 
(Wang, 1992), Warren Grice (Grice, 1997), and Paul Kwiat 
(Kwiat, 1993), as well as the Physics Reports article by S.P. 
Walborn et al.(Walborn et al., 2010). 


the case of degenerate, collinear SPDC in the paraxial 
regime using the results in (Monken et al., 1998). We 
also use geometrical arguments to explain the approx¬ 
imations allowed in the paraxial regime. In Section 4, 
we show how to further approximate this approximate 
biphoton wavefunction as a Double-Gaussian (as seen in 
(Law and Eberly, 2004) and (Fedorov et al., 2009)), as 
the multivariate Gaussian density is well studied, and is 
easier to work with. In doing so, we give derivations of 
common statistical parameters of the Double-Gaussian 
wavefunction, showing its convenience in multiple appli¬ 
cations. In Section 5, we provide a calculation of the 
transverse correlation width, defined as the standard de¬ 
viation of the transverse distance between the signal and 
idler photons’ positions at the time of their creation. In 
Section 6, we explore the utility of the transverse corre¬ 
lation width, and introduce the concepts of the bipho¬ 
ton birth zone, and of the birth zone number as a mea¬ 
sure of biphoton correlation. We conclude by using the 
birth zone number to gain a qualitative understanding of 
the tradeoff between the first-order spatial coherence and 
the measurable correlations between photon pairs in the 
downconverted fields. 

II. FOUNDATION: THE QUANTUM - OPTICAL 
CALCULATION OF THE BIPHOTON STATE IN SPDC 

The procedure to quantize the electromagnetic field as 
it is used in quantum optics (Loudon, 2000; Mandel and 
Wolf, 1995) (as opposed to quantum field theory), is to: 
decompose the electromagnetic field into a sum over (cav¬ 
ity) modes; find Hamilton’s equations of motion for each 
field mode; and assign to the classically conjugate vari¬ 
ables (generalized coordinates and momenta), quantum- 
mechanically conjugate obervables, whose commutator is 
ih. From these field observables, one can obtain a Hamil¬ 
tonian operator describing the evolution of the quantum 
electromagnetic field, and in so doing, describe the evo¬ 
lution of any quantum-optical system. 

SPDC is a -nonlinear process. To describe it (Man- 
del and Wolf, 1995), we begin with the classical Hamil¬ 
tonian of the electromagnetic field; 

n EM = \jd 3 r (D-E + B-H), (1) 

where D = eoE + P. Since the electric field amplitude of 
the incident light on a nonlinear medium is usually sub¬ 
stantially smaller than the electric field strength binding 
the atoms in a material together, we can express the po¬ 
larization field P as a power series in the electric field 
strength (Boyd, 2007), so that 

P = eo [x (1) E + X (2) (E) 2 + X (3) (E) 3 + ...]. (2) 

Since the nonlinear interaction beyond second order is 
considered here to not appreciably affect the polariza- 
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tion, the classical Hamiltonian for the electromagnetic 
field can be broken up into two terms, one linear, and 
one nonlinear; 

H-em = Hl T T~LnLi (3) 


where, 

Knl = \e a J d 3 r Ei (?, t)E 3 (r, t)E t (r, t). (4) 


Next, since the nonlinear susceptibility xf-i depends on 
pump, signal, and idler frequencies 4 , each of which are 
determined by their respective wave numbers, the nonlin¬ 
ear Hamiltonian is better broken down into its frequency 
components: 


E-nl = J d 3 r ^ [x$(w(k p ), w(ki), w(k 2 )) 

( 5 ) 


k p ,ki,k 2 

x J B i (w(k p ))^(w(ki))E;(w(k 2 ))], 


where subscripts 1 and 2, are understood to refer to signal 
and idler modes, respectively. 

To condense this paper, we note that when the field 
quantization is carried out, our electric field functions 
E( r, t) are replaced by the field observables E{ r, f), which 
separate into a sum of positive and negative frequency 
contributions E + (r,t), and E~(r,t), where 


E + (r,t) = —r 


I Huj( k) 


Vi 4r^ 


2en 


,(*) e "k 


Jkr 


( 6 ) 


and E~ (r, t) is the hermitian conjugate of E + ( r, t). Here, 
s is an index indicating component of polarization, e is a 
unit polarization vector, and fig s (t) is the photon anni¬ 
hilation operator at time t. In addition, V is the quan¬ 
tization volume 5 , which in the standard quantization 
procedure, would be the volume of a cavity which can be 
taken to approach infinity for the free-space case. 

With the electric field observables thus defined, we can 
obtain the quantum Hamiltonian of the electromagnetic 
field: 


& = H ,a 

k,s 

+ \ 6 ° J (?>*)> (7) 


4 At this point we would like to point out that we use the Einstein 
summation convention for x\ji Ei (r, t)Ej (r, t)Ei (r, t). 

5 The quantization volume is the volume of the hypothetical cavity 
containing the modes of the electromagnetic field. For simplicity, 
the cavity is taken to be rectangular, so the sum over modes is 
straightforward using boundary conditions in Cartesian coordi¬ 
nates. To get an accurate representation of the electromagnetic 
field in free space, we may take the quantization volume to be 
arbitrarily large. 


where the first term is the linear contribution to the 
Hamiltonian. Though this Hamiltonian looks relatively 
simple, the field operator E(r,t) = E + (r,t) + E~(r, t), 
so that the integral in (7) actually contains eight terms. 
These terms correspond to all different x 2 processes (e.g., 
sum-frequency generation, difference-frequency genera¬ 
tion, optical rectification, etc.), each of which has its 
own probability amplitude of occurring. However, given 
that we have a single input field (i.e., the pump field), 
and start with no photons in either of the signal and 
idler fields, the only energy-conserving contributions to 
the Hamiltonian (i.e., the only significant contributions 
6 ) are transitions (forward and backward) where pump 
photons are annihilated, and signal-idler photon pairs are 
created. 

Our first approximation (beyond what was done to 
get (7) to begin with) is that the pump beam is bright 
enough to be treated classically, and that the pump inten¬ 
sity is not significantly diminished due to downconversion 
events. This “undepleted pump” approximation, along 
with keeping only the energy-conserving terms, gives us 
the simplified Hamiltonian: 

H = H L +^e Q J d 3 r (x[fi (?)£)(?, t)Ej (r, t)Ef (r, t)+h.c ), 

(8) 

which we then expand in the modes of the signal, and 
idler fields; 


H = H l 


+ \ £ ° J d3r (t ^ H X$(r;w(k^),w(ki),w(k 2 )) 


ki.si k 2 ,s 2 


4 1>sl w4 2lS2 w +h - c )- ( 9 ) 


Note that here and throughout this paper, h.c. stands for 
hermitian conjugate. 

Next, we assume the pump is sufficiently narrowband, 
so that we can, to a good approximation, separate out 
the time dependence of the pump field as a complex ex¬ 
ponential of frequency lj p . In addition, we assume the 


6 The reason the non-energy-conserving terms in (7) can be ne¬ 
glected is due to the rotating wave approximation. In calculat¬ 
ing the amplitude for the downconversion process, and converting 
to the interaction picture, all other contributions to this ampli¬ 
tude will have complex exponentials oscillating much faster than 
Auj = u)\ +CJ 2 — cop. Since each of these contributions (oscillating 
at frequency uj) when integrated give amplitudes proportional 
to sinc(^y^), and the propagation time T through the crystal 
is fixed, these Sine functions become negligibly small for large 
uj. Since A uj is small for nearly degenerate SPDC, the energy- 
conserving contribution dominates over the non-conserving con¬ 
tributions. 
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pump field to be sufficiently well-collimated so that, to 
a good approximation, we can also separate out the lon¬ 
gitudinal dependence of the pump field At this point, 
we define the transverse momenta q p , q 1; and q 2 ,as the 
projections of the pump wave vector k p , the signal wave 
vector ki, and the idler wave vector k 2 , onto the plane 
transverse to the optic axis respectively. We also define 
k pz , k-\ z , and fc 2 z, as the longitudinal components of the 
corresponding wave vectors. With this in mind, we ex¬ 
press the pump field as an integral over plane waves: 


(such that d 3 r = dxdydz), to get the Hamiltonian: 
H = H l 


+ 4n e °! d2qp 


LxLyLz 
V 


Y2 


ki,sik 2 ,S2 


X (^k p M^ki,si)j(^k 2 ,S2^ 


' ^ 2 o;(ki)a;(k2) 

4^ 


x sine 


^QxL/ x \ . f' Z L / 


sme 


V 2 


sme 


z 'e- i ”* t E(Q p ,t) 


V 2 


E ^ t) = ^ 


d"q p Ai(q p , t)e 




■F)g i(k zp z-u>pt) 


( 10 ) 


x “c W 

ki,«i k 2 ,s 2 


h.t 


( 12 ) 


By separating out the transverse components of the 
wave vectors, we make the Hamiltonian easier to sim¬ 
plify in later steps. As one additional simplification, 
we define the pump polarization vector eg , so that 
Ei(q p ,t) = -E(q p ,i)(eg )*. With the transverse compo¬ 
nents separated out, and the narrowband pump approx¬ 
imation made, the Hamiltonian takes the form: 


H = Hr 


+ fe./ d 3 rd 2 q p (v" Y2 Y2 X^ } (r;w(kp),w(ki),o;(k2)) 


ki.si k 2 ,s 2 




I h 2 Lo(ki)uj(k2) 

K 


X e -i( A q)-? e -i A fc,* e -fa, P t^ (^ t )gt (i)at (t) +h.c\ 

‘ ki,«i k 2 ,s 2 J 

( 11 ) 


where we define Aq = qi + q 2 — q p , and A k z = k\ z + 
k2z ~~ k pz • 

In most experimental setups (including the one we con¬ 
sider here), the nonlinear crystal is a simple rectangular 
prism, centered at r = 0, and with side lengths L x , L y . 
and L z . Here, we assume the crystal is isotropic, so that 
Xiji does not depend on r. To simplify the subsequent 
calculations, we assume the crystal to be embedded in a 
linear optical medium of the same index of refraction to 
avoid dealing with multiple reflections. Alternatively, we 
could assume the crystal has an anti-reflective coating to 
the same effect. We can then carry out the integral over 
the spatial coordinates (from —4 to ^ in each direction) 


7 For a reference that examines in detail how the downconverted 
light is affected by the pump spatial profile, we recommend the 
reference (Pittman et al., 1996a). For a reference that treats 
SPDC with short pump pulses (as opposed to continuous-wave), 
see (Keller and Rubin, 1997). 


Note that the Sine function, sinc(a’), is defined here as 
sin(a;)/a;. 

To obtain the state of the downconverted fields, one 
can readily use first-order time-dependent perturbation 
theory. To see why this is, we can compare the nonlin¬ 
ear classical Hamiltonian to the linear Hamiltonian using 
typical experimental parameters of a pump field inten¬ 
sity of lmIT/m?n 2 , and signal/idler intensities of about 
IpW/mm 2 . As such, the nonlinear contribution to the 
total Hamiltonian is indeed very small relative to the lin¬ 
ear part, and the consequent results we obtain from these 
first order calculations should be quite accurate. Though 
this is the treatment we discuss, alternative higher order 
and non-perturbative derivations of the quantum state 
of down-converted light are also useful in examining the 
photon number statistics of down converted light, par¬ 
ticularly when a sufficiently intense pump beam makes 
it significantly probable that multiple pairs will be gen¬ 
erated at once through the simultaneous absorption of 
multiple pump photons. Indeed, the general two photon 
state is described as a multimode squeezed vacuum state, 
whose photon number statistics have been shown experi¬ 
mentally (and theoretically) to be such that the number 
of pairs created in a given time interval is approximately 
Poisson-distributed (Avenhaus et al ., 2008; Christ et al ., 
2011 ). 

Using first-order time-dependent perturbation theory, 
the state of the signal and idler fields in the interaction 
picture can be computed as follows: 

Mt)) « (i - { J* dt'H NL (t o) |*(0)>. (13) 

Note that in the interaction picture, operators evolve ac¬ 
cording to the unperturbed Hamiltonian, so that a\t) = 
a^(0)e*“*. Here, the initial state of the signal and idler 
fields l’I'(O)) is given to be the vacuum state |0i,0 2 ), 
which means that the hermitian conjugate (with its low¬ 
ering operators) will not contribute to the state of the 
downconverted photon fields. 

Before we calculate the state of the downconverted 
photon fields (up to a normalization factor), we make 
use of some simplifying assumptions. First, we assume 



5 


that the polarizations of the downconverted photons are 
fixed, so that we can neglect the sums over Si and s 2 , 
effectively making the sum over one value. With this, 
the sum over the components of the nonlinear suscep¬ 
tibility is proportional to the value d e g = ^X'eS > which 
is the effective experimentally determined coefficient for 
the nonlinear interaction. Second, we assume that the 
nonlinear crystal is much larger than the optical wave¬ 
lengths considered here, so that the sums over ki and k 2 
can be replaced by integrals in the following way: 


lim "S'] 

V—>oo V _f "— J 

ki,si 



(14) 


With these simplifications, we can express the nonlinear 
Hamiltonian in such a way that both the sums are re¬ 
placed by integrals, while still accurately reflecting the 
relative likelihood of downconversion events; 


H N l « C NL d ef f j j d i kid i k 2 y w(ki)w(k 2 ) 

r \ | f p( 

■ip 


x / d 2 q p 


n 

m —1 


sine 


E(q p ,t)e iAult 


x a t (ki)a t (k 2 ), 


(15) 


where m = { 1,2,3} = {x, y , z \, and Aw = u>\ + u> 2 — w p , 
and Cnl is a constant. Note that here, A k x = A q x and 
A k y = A q y , to condense notation. 

With one additional assumption, that the slowly- 
varying pump amplitude (excluding e lulpt ) is essentially 
constant over the time light takes to propagate through 
the crystal, the integral over this nonlinear Hamiltonian 
becomes an integral of a constant times e lAult . With this 
integral, we get our first look at the state of the down- 
converted field exiting the crystal: 


I^SPnc) ~ <7 0 |Oi,0 2 > 

+ C\d eff \J I P T 2 J j d 3 k x d 3 k 2 $(ki,k 2 ) 

x y /, w(ki)w(k 2 )e l4 r L sinc Ty-^ a t (ki)a t (k 2 )|0i, 0 2 ). 

(16) 

Here, T is the time it takes light to travel through the 
crystal; I p is the intensity of the pump beam; |0i,0 2 ) is 
the (Fock) vacuum state with zero photons in the signal 
mode and zero photons in the idler mode, and; 


r 3 


$(ki,k 2 ) = 


d 2 


dp 


n 

m=l 


sme 


f 

\ 2 


^(q P ), 


(17) 


is, up to a normalization constant, the biphoton wave- 
function in momentum space (where i/(q p ) is the nor¬ 
malized pump amplitude spectrum). To see how this 


works, we note that the biphoton probability amplitude 
can be expressed as (Oi, 0 2 |a(ki)a(k 2 )|'l'sp£>c'). When 
we normalize this probability amplitude, by integrating 
its magnitude square over all values of k l5 and k 2 , and 
setting this integral equal to unity, the resulting normal¬ 
ized probability amplitude has the necessary properties 
(for our purposes) of a biphoton wavefunction 8 . With 
approximations to be made in the next section, only 
$(k l5 k 2 ) will govern the transverse momentum proba¬ 
bility distribution of the biphoton field 9 . 

The factors preceding the biphoton wavefunction are 
still important to understand because (with some alge¬ 
bra) they contribute to the rate of downconversion events 
( Rspdc )• In particular 10 : 

Rspdc oc dlffP p Ll, (18) 

where P p is the pump power (in Watts). We also 
note from equation (16) that the rate of downconver¬ 
sion events is also proportional to sinc 2 (AwT'/2), though 
this factor is essentially unity for the nearly degenerate 
frequencies of the signal and idler photons considered 
here. This proportionality also follows from more rig¬ 
orous calculations of the rate of downconversion events 
(Hong and Mandel, 1985; Kleinman, 1968), though only 
in the approximation where the minuscule signal/idler 
fields don’t appreciably contribute to the likelihood of 
downconversion events. In those more rigorous calcula¬ 
tions, the conversion efficiency (biphotons made per in¬ 
cident pump photon) is of the order 1CP 8 , which again 
shows just how weak these signal/idler fields are relative 
to the pump field 11 , and why first-order perturbation 
theory is sufficient to get a reasonably accurate represen¬ 
tation of the state of the downconverted fields. We also 
note that although beam size doesn’t affect the global 
rate of downconversion events, it does affect the frac¬ 
tion of those downconversion events that are likely to be 
counted by a detector near the optic axis 12 . Even so, 


8 Though it is debatable whether it is correct to speak of a bipho¬ 
ton wavefunction since expectation values are in fact carried out 
with Ty p no) • and (k ]. ko j does not evolve according to the 
Schrodinger equation (j'T'yp oc) does, though), <t>(ki,k 2 ) is a 
square-integrable function in a two- particle joint Hilbert space 
that accurately describes the relative measurement statistics of 
the biphotons. 

9 Note that the biphoton wavefunction (17) is expressed as an 
integral over the rectangular crystal shape. For those interested 
in a derivation of the integral giving the biphoton wavefunction 
for a generalized crystal shape, see (Saldanha and Monken, 2013) 

10 Note that this downconversion rate comes from the approxima¬ 
tion of near-perfect energy conservation: i.e., A uiT << 1. (Helt 
et al., 2012) 

11 Including the collection/coupling efficiencies in many quantum- 
optical experiments, the measured conversion efficiency is closer 
to 10 _1 °. 

1 - The rate of downconversion events yielding biphotons propagat¬ 
ing close to the optic axis increases with a smaller beam size, 
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these factors are useful to know when selecting a crystal 
as a source of entangled photon pairs. For example, with 
a constant power pump beam, a longer crystal will be 
a brighter source of photon pairs. However, there is a 
tradeoff; the degree of correlation between the signal and 
idler photons decreases with increasing crystal length (as 
we shall show). 



FIG. 1: Diagram of the relationship between the pump, 
signal, and idler momenta, in standard (not necessarily 
collinear) phase matching. 


III. APPROXIMATION FOR DEGENERATE COLLINEAR 
SPDC 

To obtain a relatively simple expression for the bipho¬ 
ton field in SPDC, we have made multiple (though rea¬ 
sonable) simplifying assumptions. We have assumed that 
the pump is narrowband and collimated so that it is 
nearly monochromatic, while also having a momentum 
spectrum whose longitudinal components dominate over 
its transverse components. We next assumed that the 
pump is bright enough to be treated classically, but not 
so bright that the perturbation series approximation to 
the nonlinear polarization breaks down. In addition, we 
assumed that we need not consider multiple reflections, 
and that the crystal is large compared to an optical wave¬ 
length so that sums over spatial modes may be replaced 
by integrals. We have also assumed that the pump is 
bright enough that it is not attenuated appreciably due 
to downconversion events. 

Now, we consider the experimental case where we place 
frequency filters over photon detectors, so that we may 
only examine downconversion events which are degen¬ 
erate (where cj% = w 2 ),and perfectly energy-conserving 
(Aw = 0). In this case, along with all the previous as¬ 
sumptions made, we define a new constant of normal¬ 
ization Ci (absorbing factors outside the integrals), and 
obtain the following simplified expression for the state of 
the downconverted field as seen in (Monken et al. , 1998); 

I'I'spdc) ~ C*o101,0 2 ) (19) 

+Ci JJ d 3 k\d 3 k 2 $(ki, k 2 )a^(k 1 )a^(k 2 )|0i, O 2 ). 


Here, the biphoton wavefunction $(ki,k 2 ) is as defined 
previously (17). Next, we use the fact that the transverse 
dimensions of the crystal are much larger than the pump 
wavelength to carry out the integral over the transverse 


but only to a point. For a good summary, see (Ling et al., 2008). 
For a more detailed discussion on how focusing affects the frac¬ 
tion of downconverted light propagating near the optic axis, see 
(Ljunggren and Tengner, 2005). For a more rigorous discussion 
of how the rate of downconversion events (i.e., the signal/idler 
power) changes with the crystal length, see (Loudon, 2000). 


pump momentum. 


<f>(ki, k 2 ) = sine 


^ A k z L z 


d P 


sme 


^ A k x L x ^ 


sme 


/ A k y Ly 

V 2 


v(kpxi kpy)i 


( 20 ) 


The significant contributions of the Sine functions to the 
integral will come from when, for example, A k x < . 

where D is the ratio of the width of the crystal L x to 
the pump wavelength X p . Where the crystal is much 
wider than a pump wavelength, Q is large, and we see 
the Sine function will only contribute significantly when 
Ak x is only a very small fraction of k p . Thus, with a 
renormalization, the sines act like delta functions, setting 
q P = qi + q 2 , and giving us the biphoton wavefunction: 

$(ki,k 2 ) = Vsinc^^—^z/(qi +q 2 ), (21) 

where JV is a normalization constant. 

Since most experiments are done in the paraxial 
regime, we use such approximations to get the Sinc- 
Gaussian biphoton wavefunction, ubiquitous in the lit¬ 
erature. With the previous assumptions already made, 
we point out that in degenerate, collinear SPDC, |ki| = 
|k 2 1 = |k p |/2, which we redefine as fci,fc 2 , and k p / 2, to 
simplify notation. In addition, since the transverse pump 
momentum is essentially equal to the sum of the trans¬ 
verse signal and idler momenta, the three vectors can be 
readily drawn on a plane, as seen in Fig. 1. 

Let 6 be the angle between the pump momentum k p 
and the signal or idler momentum vectors ki and k 2 . 
This angle 9 is small enough that we may use the small- 
angle approximation to find an expression for A k z in 
terms of easier-to-measure quantities. Using the conser¬ 
vation of each component of the total momentum, we get 
the following equations: 


k p = (fci + k 2 ) cos (9) — A k z 


( 22 ) 


|qi - q 2 | 


= k\ sin(0). 


(23) 


Using the small-angle approximation, and substituting 
one equation into the other, we find: 

|2 


Afc, « - 


|qi - q 2 | 
2 k„ 


(24) 
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Finally, when we assume the transverse pump momentum 
profile is a Gaussian, 

Kq p)= ( 2 ^) 4 e"^ 1 ^ 2 , (25) 

with (jp being the pump radius in position space 13 , we 
renormalize, and find the biphoton wavefunction to be: 

|qi-q 2 | 2 )e-^^+^l 2 , (26) 

where the minus sign in the argument of the sine function 
is eliminated since the sine function is an even function. 

To obtain a transverse correlation width from this 
biphoton wavefunction, we need to transform it to po¬ 
sition space. Fortunately, this biphoton wavefunction 
is approximately 14 separable (subject to our paraxial 
approximation) into horizontal and vertical wavefunc- 
tions (i.e., into a product of functions, one dependent 
on only x-coordinates, and the other dependent on only 
y-coordinates). In addition, we can find an orthogonal 
set of coordinates in terms of sums and differences of 
momenta that allows us to transform this wavefunction 
by transforming the Sine function and Gaussian indi¬ 
vidually. While transforming the Gaussian is extremely 
straightforward, transforming the concurrent Sine-based 
function is more challenging, owing to that it is a Sine 
function of the square of a momentum coordinate, and is 
not in most dictionaries of transforms. 

IV. THE DOUBLE-GAUSSIAN APPROXIMATION 

In what follows here, we approximate the Sinc- 
Gaussian biphoton momentum-space wavefunction (26) 
as a Double-Gaussian function (as seen in (Law and 
Eberly, 2004) and(Fedorov et al ., 2009)), by matching 
the second order moments in the sums and differences 
of the transverse momenta. Transforming this approxi¬ 
mate wavefunction to position space, and computing the 
correlation width gives us an estimate of the true corre¬ 
lation width seen experimentally that we later compare 
with more exact calculations and experimental data. In 


13 The pump radius a v in position space is defined as the standard 
deviation of , ' 1 (j ,T2 . This is justified by noting that the pump 
radius in momentum space is explicitly given by the standard 
deviation of (k\ x + k 2 x), and using the properties of Fourier 
transformed Gaussian wavefunctions. 

14 For small values of x and y, sine (a; + y) ~ sirK:(x)siric(y). For 
typical experimental parameters, the argument of the Sine func¬ 
tion is of the order 10 3 , even for transverse momenta as large 
as the pump momentum. With the paraxial approximation, the 
transverse momenta are much smaller than the pump momen¬ 
tum, and so the arguments of the Sine functions are very small 
indeed. 


addition, we take a moment to explore the conveniences 
that come with the Double-Gaussian wavefunction. 

In this analysis, we consider only the horizontal com¬ 
ponents of the transverse momenta, since the statistics 
are identical (with our approximations) in both trans¬ 
verse dimensions. The transverse pump profile is already 
assumed to be a Gaussian. Our first step is to trans¬ 
form to a rotated set of coordinates to separate the Sine 
function from the Gaussian. 

Let 

k + = hx + r k2x and k- = fcl3: ~ fc2x . (27) 

v 2 v 2 

With these rotated coordinates, the (horizontal) bipho¬ 
ton wavefunction becomes: 

(j>(k+,k-) = ■A/'sinc^ '^*'^’ k^J e~ 2<7 p k +. (28) 

Taking the modulus-squared and integrating over k+, 
we isolate the probability density for fc_: 

p(k-) = ^ ^/^sinc 2 (ak 2 _) (29) 

where a = L ^ p , for convenience. p(k~) is an even func¬ 
tion, so its first-order moment, the expectation (fc_) = 0. 
The second-order moment is nonvanishing, with a value 
(k^_) = With this second order moment, we can fit 
p{k~) to a Gaussian by matching these moments. In do¬ 
ing so, p{k -) is approximately a Gaussian with width 



To see how good this Gaussian approximation of p(k_) 
is, we show in Fig. 2, both the Sine-based probability den¬ 
sity (in momentum space) and the approximate Gaussian 
density with matched moments. The overall scale of the 
central peak is captured but the shape is significantly dif¬ 
ferent. However, a Gaussian probability density for the 
position difference density, p(x-), appropriately scaled is 
a good approximation for values near the central peak 
(though the oscillatory behavior of the wings is still not 
captured). In Fig. 3, we plot various choices of an approx¬ 
imate Gaussian density for our transformed Sine-based 
position difference density function p(x-). We find that 
simply setting the central maxima of both densities equal 
to each other works very well, as discussed in the next 
section. 

By approximating the Sinc-Gaussian wavefunction as a 
Double-Gaussian wavefunction, the inverse Fourier trans¬ 
form to position space becomes very straightforward. We 
note that k + and fc_ form an orthogonal pair of coordi¬ 
nates, as k\ x and k 2 X do. Because of this, the inverse 


$(ki,k 2 ) =A/'sinc^ Z |^ 



p(fc-) 




FIG. 2: Plot comparing estimates of the momentum 
difference probability density p(fc_). The solid (blue) 
curve with wavy side-bands gives our Sine-based 
probability density estimate (29), where we set a = 2 
for convenience. The dashed (magenta) Gaussian curve 
gives our Gaussian-based probability density estimate 
with matching means and variances. 


Fourier transform is separable 15 , and we find: 
1 


ip(x+,x-) 


V 2 




X+ X — 


___+ 

4a 2 

e ”+ 


where a? = -r\ 


1 n 7 5 %+ 4cr 


X+ = 


XI + x 2 


and X- = 


X\ - x 2 


(30) 


= \— = 2er):, and where 


(31) 


A. Usefulness of the Double-Gaussian approximation 

Here, we digress to discuss the usefulness of the 
Double-Gaussian approximation. To begin, we express 
x+ and X- in terms of x\ and x 2 , and take the magnitude- 
squared of i/j( x\,x 2 ) to get a Double-Gaussian probability 
density p DG : 


FIG. 3: Plot comparing different estimates of p{x_). 
The solid blue wavy curve is our most accurate estimate 
from the transformed Sine-based distribution (51). The 
tall dashed (magenta) curve is the Gaussian distribution 
obtained from matching momentum means and 
variances, while the shallow dashed (red) curve is the 
Gaussian distribution obtained by matching position 
means and variances. The solid (green) curve gives us a 
refined Gaussian approximation, by setting the central 
maximums equal to one another. The flat (gold) line, 
gives the height of the half maximum of the Sine-based 
probability density (51) (blue curve). We see that the 
widths of half maximum are nearly identical (off by less 
than 0.3%) for the Sine-based and refined Gaussian 
distributions Again, we set a = 2 for convenience. 


Here, we define the transverse correlation width as the 
standard deviation of the distance between x\ and x 2 
(i.e., rr(x-i-x 2 ))- This is not defined as a half-width, since 
it represents the full width (at of the maximum) of 
the signal and idler photons’ position distributions con¬ 
ditioned on the location of the prior pump photon (see 
Section VI). For the Double-Gaussian density, the trans¬ 
verse correlation width, a( Xl _ X2 ' ) , is V2a x _. 

Alternatively, the Double-Gaussian density can be put 
into the standard form of a bi-variate Gaussian density 
function; 


p L>G (xi,x 2 ) = 


2'xa x+ a x _ 


<*1-^2 > 


(®l+®2 L 


■ (32) 


p DG {:ii,x 2 ) 
where 


Vac — b 2 \ 

e 


ax\-\-2bx\X2-\-cx%) 


(33) 


15 The Fourier transform convention we use is the unitary conven¬ 
tion: l ip(kix,k 2 x) = ^ ff dx\dx 2 e~’ l ^ Xlklx+X2k2x ^(x 1 , 012 ), 
and ip{x\,X 2 ) = ^ ff dk\ x dk 2 x e z ( Xlklx+X2k2x ' ) 'ip(ki x ,k 2 x)- 
Since the Fourier transform is invariant under rotations (i.e., 
since the argument in the exponential can be thought of as an 
inner product between two vectors), we get identical formulas for 
the Fourier transform in rotated coordinates. In particular, we 
find ^(®+,«_) = ±ffdk+dk- e^ x + k + +x - k -^(k + ,k-). 


a = 

b = 


_ + al_ 

4o ’£ + °'L 

•2 


4(7^ 


(34) 

(35) 


This Double-Gaussian probability density has a number 
of useful properties. First, it is separable into single 
Gaussians in rotated coordinates, making many integrals 
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FIG. 4: Plot of the Double-Gaussian probability density 
for <j x , = 1 unit and a x _ = 0.075 units. The horizontal 
dotted line indicates a particular value of x 2 , so that 
the half-width of the Gaussian along that dotted line is 
the conditioned half-width ar Xl \ X2 y The equivalent 
Sinc-Gaussian distribution will have subtle side bands 
(at about 5.0% the maximum intensity) parallel to the 
long axis of this Double-Gaussian as in Fig. 3. 

straightforward to do analytically. Second, the marginal 
and conditional probability densities of the Double Gaus¬ 
sian density function are also Gaussian density functions. 
Because of this, many statistics of the Double-Gaussian 
density have particularly simple forms. For examples, 
consider the statistics in Table 1. 

In addition, the Double-Gaussian is uniquely defined 
by its marginal and conditioned means and variances. As 
seen in Fig. 4, these values give a straightforward char¬ 
acterization of the overall shape of the Double-Gaussian 
distribution. 

1. Fourier-Transform Limited properties of the Double-Gaussian 

Gaussian wavefunctions are minimum uncertainty 
wavefunctions in that they are Fourier-transform limited; 
Heisenberg’s uncertainty relation; 

ctzCTfc > (36) 

is satisfied with equality. The Double-Gaussian wave- 
function (32) factors into a product of two Gaussians 
(one in x+ and the other in x_), and so the standard 
deviations of these rotated coordinates also saturate the 


TABLE I: Statistics of the Double Gaussian 


Name 

Value 

marginal means 

(* 1 ) = ( X2 ) = 0 

conditioned mean 

2xi 

(^ 2 ) p(x2 |cci) X 1 0-2 10-2 TX\ 

X_|_ 1 X _ 

marginal variance 

_2 1 _2 

2 2 

u xi u x 2 2 

conditioned variance 

_2 _2 *<*+<*- 
a (x 1 \x 2 ) ~ a (x 2 \xi) — ct 2 +ct 2 

co-variance 

{X 1 X 2 } {X\)(X 2 ) = + 2 

Pearson r value 

CT^ 0.2 

- _ {xix 2 )-{x 1 ){x 2 ) _ x + 

l — — —- 

ct ^ 1 ct ^2 CT x + + t7 J_ 

joint entropy 

h{xi, x 2 ) = \og{2-Ke<j x+ a x _) 

marginal entropy 

h{x 1 ) = |log(7r e(al + + a x _)); 


= h(x 2 ) 

mutual information 

h(xi:x 2 ) = h(xi)+h(x 2 ) — h(xi, x 2 )\ 
= 1 °g(2a!: + a x ) = 1 °g( CT(xi| 1 a , 2) ) 

Probability notation 

2 _ 2 
(J {x 1 \x 2 ) — a p(x i\x 2 ) 


Heisenberg relation: 

= 7 ^ : Gx_®k_ = 2 ' ( 37 ) 

Remarkably, the simple expressions for the statistics of 
the double-Gaussian distribution (see Table I) show that 
these are not the only pairs that are related this way. 
Since conditioning measurements on a single ensemble of 
events A doesn’t change the fact that those measurements 
must satisfy an uncertainty relation, we find: 

> 2 ( 38 ) 
is still a valid uncertainty relation. In addition, since 
conditioning on average reduces the variance 16 , we arrive 
at the relations: 

&xi&(ki\k 2 ) > — . v kl cr( Xl \x 2 ) ^ ( 3 9 ) 

These relations are also useful for understanding how 
narrowband frequency filters undermine the resolution 
of temporal correlations as discussed in Appendix C. For 
the Double-Gaussian state, these relations are saturated 


16 That conditioning on average reduces the variance can be seen 
from the law of total variance. Given two random variables X 
and Y, the variance of X is equal to the mean over Y of the con¬ 
ditioned variance Var(X\Y) plus the variance over Y of the con¬ 
ditioned mean i?[A'|Y]. Both of these terms are non negative, so 
the mean conditioned variance never exceeds the unconditioned 
variance. 
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as well. From these properties, we may find many other Because our position-space wavefunction is also (ap- 
useful identities for the double-Gaussian including: proximately) separable into a product of vertical and 

horizontal wavefunctions, we can propagate ip DG {x\, x 2 ) 
r x = —Tk (40) (i. e ., the Double-Gaussian approximation to ip(xi,x 2 )), 

a x + _ Qfc- <T X1 _ a (k 1 \k 2 ) without having to first propagate the entire transverse 

a x _ <Xk + a (x 1 \x 2 ) crfcj wavefunction. Doing so, gives us: 


where r x and Tk is the Pearson correlation coefficient _ ** 1 * 10 , 2 fcjh 

for the position and momentum statistics of the Double- ip DG { x i, x 2 : z±, z 2 ) = T 1 [4> DG (ki x , k 2x )e k ? e k p ], 
Gaussian, respectively. (46) 

where J 7-1 is the inverse-Fourier transform operator. 
Perhaps surprisingly, propagating the double-Gaussian 
2. Propagating the Double-Gaussian field field simply gives another bi-variate Gaussian density 

(see in appendix, (Al)). What changes through prop- 
One especially useful aspect of the Double-Gaussian agation is the parameters defining the Double-Gaussian 

wavefunction, is that it is simple to propagate (in the field. Taking (30) to be the biphoton field backpropa- 

paraxial regime). Given the transverse momentum am- gated to the center of the crystal (where zi = z 2 = 0, 

plitude profile of a nearly monochromatic optical field in and a x+ and <j x _ to be parameters defining the field at 

one transverse plane, we can find the transverse momen- Zl = z 2 = 0, the transverse probability density of the 

turn profile at another optical plane by multiplying it by photon pair when the propagation distances are equal 

the paraxial free-space transfer function Tf s (z : k x ,k y ) (i.e., z\ = z 2 = z), as they would be if we measure both 

17 ■ fields is the same image plane, gives us: 


T fs (z:k x ,k v ) = e ikz -^ +k y\ ( 42 ) 

For an entangled pair of optical fields at half the pump 
frequency, the full transfer function becomes: 

Tfs(zi,z 2 : k\ xi k\ y , k 2x , k 2y ^) = 

= +fc? y ) e -^GL+^)_ (43) 

Since a global constant phase e *^4 Zl+Z2 ) can come out¬ 
side the Fourier transform integral, and the relative 
phases and amplitudes in position space will be inde¬ 
pendent of this factor, we can remove it from the trans¬ 
fer function, and express the remaining transfer function 
simply as a product of a horizontal and vertical transfer 
function: 

Tfs(zz 2 . k\ x , ki y , k 2x , k 2y ) 

— r kf sx [z\,z 2 : k\ x , k 2x )T f S y[z\, z 2 : ki y ,k 2y ^). (44) 

where 

iz ‘‘ _ iz 2 k 2:r 

Tf sx (z\,z 2 : k lx ,k 2x ) = e k p e k p , ( 45 ) 

and Tf sy {z\,z 2 : ki y ,k 2y ) is similarly defined. 

1 ' The free space transfer function comes about due to the momen¬ 
tum decomposition of an optical field being a sum (or integral) 
over plane waves. For each plane wave defined by k x , k y , and 
k z , we add a phase corresponding to the plane wave translat¬ 
ing a total forward distance z. The particular form of the free 
space transfer function used here is due to the small angle- or 
paraxial approximation. For a good reference on this topic, see 
(Goodman et al., 1968). 



This particularly illustrates the convenience of working 
with the Double-Gaussian density, as we need only find 
effective values for a x+ and <j x _ at one distance z to see 
how the biphoton field might change under propagation 
to the same imaging plane 18 . In particular, where a x+ is 
much larger than cr x _ for highly entangled light, we can 
see that the position correlations (say, as measured by 
the Pearson correlation coefficient) decrease to zero, and 
gradually become strong anti-correlations as we move to 
the far field. This does not imply, however, that the pho¬ 
ton pairs dis-entangle and re-entangle under propagation 
(Chan et al, 2007); the entanglement migrates to the 
relative phase of the joint wavefunction and back again. 
Throughout the rest of this paper, all transverse corre¬ 
lation widths, probability densities, and biphoton ampli¬ 
tudes will be assumed to be taken at z\ = z 2 = 0 (or an 
image plane conjugate to this plane), unless otherwise 
specified. 


18 On the other hand, propagating to independent imaging planes 
z\ and 22 is a more elaborate result discussed in the Appendix. 
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V. ESTIMATING THE TRANSVERSE CORRELATION 
WIDTH 

Using our earlier notation, the approximation to the 
Double-Gaussian wavefunction is expressed as follows: 


ip{x i,X 2 ) 


(x t -a 2 ) 


(x 1 +x 2 )' S 


2x7271 


(50) 


' p u X — 


where a x _ = 
width, cr( Xl - X2 ) 


making the transverse correlation 



To see just how good (or not) this Gaussian-based esti¬ 
mate of <T(xi-x 2 ) ^ s > we compare our Gaussian approxima¬ 
tion to the the probability density of X- obtained when 
taking the Fourier transform of the Sine-based function of 
k- (29). The more accurate probability density obtained 
from that Fourier transform is: 



where C(x) and S{x) are the Fresnel integrals, integrating 
over cos (ft 2 ) and sin(|f 2 ), respectively. As seen in Fig.3, 
the Gaussian approximation obtained by matching (fc_) 
and (A: 2 ) gives a full width at half maximum (FWHM) 
within an order of magnitude of the FWHM of the more 
accurate approximation (51). 


However, with a width a x _ of y ^ (i.e., by setting the 
maximums of our (Sine-based and Gaussian-based) ap¬ 
proximate density functions equal to one another), where 
again, a = p , one obtains a FWHM only 0.3% smaller 
than the FWHM from the more accurate case (51). In¬ 
deed, numerical estimates based on fitting the widths of 
the Sine-based density to the Gaussian density have been 
performed (Chan et al, 2007; Law and Eberly, 2004) to 
great effect (Edgar et al, 2012). Since choosing which 
width to fit is somewhat arbitrary, we point out that the 
peak-matching fit also fits the full width at 48.2% of the 
maximum. However, the best estimate of cr x _ is obtained 


by an explicit calculation of a x _ = ^ j(x^_) from the more 
accurate density (51). Remarkably, we find that a x _ is 
simply , which in turn gives us a transverse correla¬ 
tion width, <j( Xl _ X2 ), of y/In addition, matching this 
exact variance to define an approximate Double-Gaussian 
wavefunction also gives us the maximum likelihood esti¬ 
mate (i.e., the estimated distribution with minimum rel¬ 
ative entropy to the more accurate model) of a Double 
Gaussian distribution fitting our more exact results. As 


a summary of our calculations, see the following: 

(exact) pz (exact) / 18 a / 9L Z \ P 

Cr (xi-x 2 ) ~ ^ (Jx ~ ~ V 5 — V 107r ’ /Q2) 

APM) _ pz(PM) _ _ /4L Z \ P 1 } 

<J (x 1 -x 2 ) ~ v-^_ — y 9 - y 07r • 

Here, <j pm refers to the peak-matching estimate that also 
nearly matches the widths of the Gaussians, while o-( exact ) 
is our more accurate calculation. Both estimates have 
their uses when examining experimental data, as we shall 
see. 

Though explicitly calculating the variance of X- ac¬ 
cording to our accurate density function (51) gives us 
the best possible estimate of <j x _, it does not necessar¬ 
ily give us the best fitting Gaussian approximation to 
the Sine-based distribution. The Gaussian obtained by 
explicitly matching position means and variances, gives 
a distribution about 42.3% wider than the close fitting 
distribution we obtain by matching peak values (see Fig. 
3 for comparison). The resulting (overly wide) scaled 
Gaussian distribution (by matching variances) does not 
accurately reflect the probabilities of the most likely out¬ 
comes (near X- = 0) (e.g., that within ± one ’’sigma”, 
we should get approximately 68% of the total data). In¬ 
deed, by setting the central maximums equal to one an¬ 
other, we also find the Gaussian cumulative distribution 
function (CDF) that most accurately resembles the CDF 
of our more accurate distribution (51) near its median. 
As an example of the accuracy of this approximation, 
our peak-matching approximate Gaussian distribution, 
gives a total probability within one a x _ from the origin 
of 68.3%, while the more accurate density function gives 
a probability of 69.0%, (an absolute difference of only 
0.7%) over the same interval. 


A. Comparison with experimental data 

Though our estimate of a^ Xl _ X2 ) follows from reason¬ 
able approximations that work well within typical ex¬ 
perimental setups, it is important to show just how well 
(or not) this estimate of the transverse correlation width 
corresponds with experimental data. This can be done 
in (at least) two ways. First, in (Howell et al, 2004), 
they found a measure of the transverse correlation width 
by placing a 40/rrn slit in the signal beam, and scan¬ 
ning over the idler beam with another 40/zm slit (see 
Fig. 5 for a diagram of the idealized setup). By measur¬ 
ing coincident detections as they scan, and normalizing 
the resulting histogram, they measured the conditional 
transverse probability distribution, and obtained a con¬ 
ditional width, which is approximately identical to the 
correlation width 19 . With their measurements, they ob- 


19 When p(xi,X 2 ) = p(x+)p(X—) and a x _, it follows that 

<J (x 1 —x 2 ) ~ a (x 1 \x 2 )- 
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FIG. 5: Idealized diagram of an experiment (Howell 
et al ., 2004) to measure the transverse correlation 
width. The nonlinear crystal (NLC) is just after the 
laser, with a pump filter placed just after that. The 
beam is broken into signal and idler with a 50/50 
beamsplitter. There is a loss of coincidences due to the 
beamsplitter but this doesn’t affect the spatial intensity 
profile. Two lenses are used to image the exiting face of 
the nonlinear crystal onto the image planes where two 
slits are placed. With one slit fixed, the other mobile, 
and Avalanche Photo-Diodes (APDs) behind each slit, 
one can obtain the correlation width by comparing the 
width of the coincidence distribution to the width of the 
slits. 


prisingly, this agrees more with our peak-matching esti¬ 
mate of 15.8yLtm than with our ostensibly more accurate 
estimate of 22.5/xm (for these parameters). This how¬ 
ever is to be expected, as the fitting by its very nature 
gives a result whose shape most closely resembles the 
shape the data gives, and low-level noise will mask infor¬ 
mation about the distribution beyond the central peak. 
Future experiments with higher-resolution measurements 
are needed to better explore the strength of this approx¬ 
imation. 

The second way that one can use experimental data 
to place a limit on the transverse correlation width is to 
use the comparatively larger amount of data about tem¬ 
poral correlation widths. As an example, if one knew 
that in a single downconversion event, the photons were 
generated no further than lOOfs apart 90% of the time, 
then the speed of light assures us that the photon pair 
could be no farther than 30/im apart 90% of the time as 
well. Indeed, in (Ali Khan and Howell, 2006), they mea¬ 
sured approximately a 50fs time-correlation width (using 
our convention) with downconverted photons from the 
same 390nm pump laser incident on a 2mm long non¬ 
linear crystal. With this value, we can place an upper 
bound to the transverse correlation width of the light in 
that setup by 15jLtm, which is not substantially above our 
14.9^m estimate. 


VI. THE BIPHOTON BIRTH ZONE 


tained a transverse correlation width (adjusting for our 
conventions) of about 13.5/iin (with an estimated error 
larger than 10%), 9.6% less than our theoretical estimate 
(52) (using their pump wavelength of 390nm and crystal 
thickness of 2mm), of 14.9/im for the same experimental 
parameters. 

Another measurement with the same laser and crystal 
was taken in (Bennink et al., 2004), where they obtained 
a transverse correlation width of 17 ± 7/r?n. Given how 
these experiments’ resolutions were limited both by fi¬ 
nite slit widths 20 and a statistical uncertainty in <jt Xl _ X2 \ 
larger than 10%, our approximation is accurate to within 
experimental uncertainty. More recently, (Edgar et al., 
2012), an experiment was performed in which the joint 
position photon distribution was imaged with a camera. 
By fitting a Double-Gaussian to their empirical distri¬ 
bution, they found a correlation width of 10.9 ± 0.7/iin 
(for their 355nm pump beam and 5mm crystal). Sur- 


20 In order to obtain an estimate of 13.5/mi for the transverse cor¬ 
relation width using slits 40//rn wide, they deconvolved their co¬ 
incidence histograms with the slit rectangle function. This gave 
them more accurate estimates for the joint coincidence distribu¬ 
tions at arbitrary resolution from which they could obtain more 
accurate estimates of the transverse correlation width. 


When a photon pair is created in SPDC, the location 
of the pair production can be essentially anywhere in 
the crystal illuminated by the pump beam. The uncer¬ 
tainty in where this pair-production takes place is limited 
by the uncertainty in the location of the pump photon. 
However, for any given photon pair created, the mean 
separation between the two is generally much smaller 
than the uncertainty (i.e., standard deviation) in the ex¬ 
pected location of the downconversion event. It is useful 
to conceptualize the region surrounding that mean po¬ 
sition where the daughter photons are most likely to be 
found as what we shall call a birth zone. 

Given that momentum is very nearly conserved in 
SPDC, we define the expected (transverse) location of 
the downconversion event as the mean of the two pho¬ 
tons’ positions x m : 


_ X 1 +X 2 _ x + 

= 2 - 71' 


(53) 


With the Double-Gaussian wavefunction (50) as our 
model for transverse position statistics in SPDC, we find 
the standard deviation in the mean position a Xm to be: 

1 

— 2 a ( Xl + X2 ) — a P' 

In addition, we define the width of the pump, A p, to 
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FIG. 6: Idealized diagram of the transverse intensity 
profile (in both x and y) of the downconverted light just 
as it exits the crystal. The blue circle encapsulates the 
region within one standard deviation of the pump 
photon position (or also approximately the signal or 
idler photon position) from the beam center. The red 
circle is centered on a particular downconversion event, 
and encapsulates the region where the signal and idler 
photons are likely to be found given that their mean 
position is known to be at the canter of the circle (i.e., 
one birth zone). For a sense of scale, we let 
A p /A bz = 10. 

be twice this standard deviation. While the photon pair 
is expected to be created at x m with uncertainty <r p , the 
signal and idler photons, conditioned on their mean posi¬ 
tion being at some x' m , will each have position uncertain¬ 
ties a' Xl = a' X2 = (T x _/y/2. As such, we define the birth 
zone width, A bz, to be twice these conditioned position 
uncertainties 21 , giving us: 

A B z = 2a' Xl = V2a x _ = CT( Xl _ X2 ). (55) 

With these definitions, we take the birth zone num¬ 
ber N (which in one dimension is Ap/A bz or alterna¬ 
tively a x+ /a x _) to be a measure of the degree of corre¬ 
lation between the signal and idler photon fields 22 . The 


21 The equation c t 1 = \/2c r x _ holds only when p{x+,x~) is sep¬ 
arable into the product p(xp)p(x—), as is the case for both the 
Double-Gaussian and Sinc-Gaussian distributions. 

22 Although the birth zone number N is one dimension is de¬ 
fined as Ap/A.gz, we can express it simply in terms of the 


birth zone number N is not to be confused with the Fe¬ 
dorov ratio R (Fedorov et al., 2004), which is the ratio 
c Xl /&(xi\x 2 )- l n a sense, each birth zone can be thought 
of as an independent source of photon pairs. In this way, 
we can develop an intuitive understanding of the coher¬ 
ence properties of the two-photon fields in SPDC 23 . 

In particular, the birth zone width is useful in many 
calculations involving entangled photon pairs in SPDC. 
Consider the mutual information of the Double-Gaussian 
distribution (32); 

h{n : a) = log ) = log (l(w + 4)). 

(56) 

Also for the double-Gaussian state, the birth zone 
number N is related to the Schmidt number K , a measure 
of entanglement 24 for pure continuous-variable states 
(Law and Eberly, 2004) (see appendix for more details). 
For this state, K = |(IV + 1/iV), and the mutual infor¬ 
mation becomes: 

h{x\ : x 2 ) = log (A') « log(iV) - 1. (57) 

where the approximation applies for large N. 

In addition, the birth zone number gives us a perspec¬ 
tive in understanding the tradeoff between the first-order 
spatial coherence and the measurable biphoton correla¬ 
tions in the downconverted fields 25 . For completeness, 
we also briefly discuss the second order spatial coherence, 
as it is related to the biphoton correlations. As a bit of 
background, the first order coherence function, g^(a, b), 
is a normalized correlation between the electric field at 
one point (a) and the electric field at another point ( b ) 
(in space or in time). If the electric field is coherent be¬ 
tween two points (so that the phase difference between 
these two points is on average well-defined), then the co¬ 
herence function will have a magnitude near unity. The 
second-order coherence function, g^ 2 \a,b), is a normal¬ 
ized correlation between the intensity (i.e., square of the 
electric field) at one point with the intensity at another 
point. While g^(a,b) can be used to characterize the 
extent of interference effects in the signal/idler beams, 


measured full-width at half-maximum of the pump. Using the 
peak-matching approximation for the birth zone width, we find: 
AT _ FWHMp 

23 For an extensive reference on the relationship between first and 
second-order coherence in biphoton fields in SPDC, see (Saleh 
et al ., 2000). 

24 Given a pure continuous-variable density operator pab — 
\' 1 Pab)(' 1 I j Ab\ describing the joint state shared by parties A and 
B , the Schmidt number K is the reciprocal of the trace of the 
square of either reduced density operator; i.e., K = l/Tr[p^J = 
l/Tr[/3^]. The Schmidt number is also known as the inverse 
participation ratio. 

25 For a more thorough look into the first-order coherence properties 
of downconverted fields, see (Dixon et al ., 2010). 
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g( 2 \a,b) can be used to characterize the extent of sig¬ 
nal/idler photon correlations. 

To examine the first and second-order coherence func¬ 
tions in SPDC for a qualitative understanding, we look 
at the symmetric first order and second order spatial co¬ 
herence functions, (g^\x,—x) and g^(x, — x), respec¬ 
tively), as they have particularly simple forms in the 
Double-Gaussian approximation. The first-order sym¬ 
metric spatial coherence is defined as: 


the second order coherence width, A g^ 2 \ as the value of 
x where g ^ falls below unity, and the correlations can 
be treated as coming from a nonclassical source of light 
27 . With these definitions, we find: 


A g (1) = A p 


1 N 2 + 1 
(N 2 - l) 2 


and 


for large N A p 

¥ ~N = 


BZ 


(64) 


g (1) (x,-x) = 


(aj(x)ai(-x)) 


\J (a t i(a:)ai(x))(a t 1 (-x)ai(-x)) 


(58) 


A g^ 


Ap /lJV 2 + l, / N 2 + 1 \ 
N y 2 N 2 - 1 ° S V 2 N ) 


which in terms of our biphoton wavefunction ip(x i,X 2 ), 
can be expressed as: 


for large N 



(65) 


g (1 \x -x) = _ f dx 2 ip*(x,x 2 )i/j(-x,x 2 ) _ 

\f (f dx 2 \tp(x,x 2 )\ 2 ){J dx 2 \ip{-x, x 2 )| 2 ) 

(59) 

We note that the expectation values taken here are taken 
with the state of the downconverted field I'Fspdc)- 
Similarly, the second-order symmetric spatial coher¬ 
ence is defined as 

„( 2 ) fr _ r) = (d\(x)d\(—x)d 2 (—x)di(x)) 
(a[(x)d 1 (x))(dl(-x)d 2 (-x))’ 

which in terms of ip(xi,x 2 ), is expressed as: 

J2)( _ x = _ \^(x,-x)\ 2 _. 

(/ dx 2 \i/j(x,x 2 )\ 2 )(f dxi\%l>(xi, —x)\ 2 ) 

(61) 

Using the Double-Gaussian approximation of the 
biphoton wavefunction, these symmetric coherence func¬ 
tions take simple Gaussian forms. For ijj(xi,x 2 ) as de¬ 
fined in (50), we find: 

g (1) (x,-x) = e ( 62 ) 


and 


g (2) (x,-x) 


x 2 ( N 2 -1 \ 

N 2 + l £a pUU 2 + J 

-e 2 \2W) 

2 N 


(63) 


To find the range of values of x over which gU) anc [ 
g( 2 ^ are significant, we define the first-order coherence 
width, Ag(b, to be the value of x where g^ falls to 
1 /y/e. Note that this can be considered to be "a" in a 
Gaussian probability density 26 . In addition, we define 


We note that the errors in these approximations decrease 
monotonically, so that for N > 12.3, the error in our 
approximation to AgU) falls below 1%, and for N > 11.4, 
the error in our approximation to A g 1 ' 2 ' 1 also falls below 
1 %. 

Based on these calculations of the first and second- 
order coherence widths, we see that for typical sources 
of downconversion (where N ~ 100) the general area 
over which the downconverted light will exhibit signifi¬ 
cant first-order coherence is approximately the same as 
the area of a birth zone A| z . Because of this, down- 
converted light beams having a large degree of position 
or momentum correlation (i.e, a large N) can be con¬ 
sidered as a collection of many independent sources of 
photon pairs, each incoherent with one another (in the 
first-order sense). The second-order coherence width tells 
us a slightly different story since it only differs greatly 
from the first-order coherence width in the limit of zero 
correlation, or JV = 1. In this limit, we see that when 
the first-order coherence width is very large (implying the 
downconverted light can be described as a single coherent 
beam), the second order coherence width is necessarily 
small. However, the reverse is not true, since for large 
A, one can have small first and second order coherence 
widths. Since a small second order coherence width could 
imply either a large or small first-order coherence width, 
the tradeoff between single photon coherence and bipho¬ 
ton correlation is best understood by comparing A g^ 
with N or the mutual information h(x\ :x 2 ). 

The relationship between the first-order spatial coher¬ 
ence of the downconverted fields, and the biphoton corre¬ 
lations as measured by the mutual information is a man¬ 
ner of tradeoff. The mutual information of the position 


26 Since At/ 1 ) is the ”lcr” half-width of g^(x, —x), and gW(x, —x) 
gives the coherence of photons separated a distance 2x away 
from one another, photons separated by less than At/ 1 ) will be 
approximately coherent. 


27 Having a second order coherence t/ 2 ) below unity is a witness 
that the source of light must be nonclassical (i.e., not a coherent 
or thermal state, as may generated from a collection of indepen¬ 
dent atoms (Loudon, 2000)). 
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correlations (57) increases with N, while A g^> (64) de¬ 
creases with N. Thus, highly correlated downconverted 
fields can be treated as incoherent light, while highly co¬ 
herent (in the first-order sense) downconverted fields can 
be treated as an uncorrelated source of downconverted 
light (i.e., as a single beam at the downconverted fre¬ 
quency producing otherwise uncorrelated photon pairs). 
As one final point on the relationship between first and 
second-order coherence, there are cases where the vis¬ 
ibilities of first-order and second-order interference are 
very simply related to one another. In particular, it was 
shown in (Saleh et al., 2000) that for the two-slit ex¬ 
periment with downconverted biphotons, the first-order 
visibility V\ and the second-order visibility V 12 follow the 
relation: 


V ! 2 + V? 2 < 1. (66) 

This can be understood both in terms of a tradeoff be¬ 
tween signal-idler entanglement and single-photon coher¬ 
ence, as well as in terms of the monogamy of entangle¬ 
ment between the signal photon, idler photon, and a mea¬ 
surement device. Indeed, this tradeoff has been used suc¬ 
cessfully to experimentally estimate the Schmidt number 
(a measure of entanglement) of photon pairs in a beam 
of down-converted light (Di Lorenzo Pires et al., 2009). 


W911NF-10-1-0404, DARPA-DSO Grant No. W31P4Q- 
12-1-0015, and AFOSR Grant No. FA9550-13-1-0019. 


Appendix A: The double-Gaussian field propagated to 
different distances 

The Double-Gaussian field (30), when propagated to 
different distances z\ and z 2 has the same double- 
Gaussian form, though the coefficients are significantly 
more complex. 


p DG (x 1 ,x 2 ;z 1 ,z 2 ) 


Vac — b 2 \ 


- (ax\ -\- 2 bx 1 X2 +CX2) 


: a = 


+ <^x-)( z 2 + k l a l + a l-) 


P 


. b _ k l(°l + - cD(~i g 2 - k 2 p a 2 x+ a 2 x _) 


: c = 


k p {<*1+ + <*l_) (4 + k 2 a x+ °x -) 


V x+ v 


(Al) 


: d = kl{zi + z 2 2 )(al + + al_f + 2 k 2 pZl z 2 (a 2 x+ a 2 x _) 2 + 

+ 4z 2 z 2 2 +4ky x y x _. (A2) 


VII. CONCLUSION 

The usefulness of spontaneous parametric downconver- 
sion (SPDC) as a source of entangled photon pairs is 
historically self-evident (see footnote 1). In this discus¬ 
sion, we have looked at the fundamental principles gov¬ 
erning SPDC in such a way as is often used in continuous- 
variable quantum information experiments (namely, de¬ 
generate collinear Type-I SPDC). We paid particular at¬ 
tention to how one can predict the transverse-correlation 
width of photon pairs exiting a nonlinear crystal from 
first principles with accuracy matching current experi¬ 
mental data. Along the way, we digressed to explore 
how the double-Gaussian wavefunction used to describe 
SPDC allows a straightforward analysis of its measure¬ 
ment statistics even under free space propagation. In ad¬ 
dition, we have developed further the concept of a bipho¬ 
ton birth zone number, and have shown how it manifests 
itself in the duality between the correlations within one 
of the downconverted fields, and the correlations between 
the downconverted fields. It is our hope that this discus¬ 
sion will inspire further interest in the transverse spatial 
correlations of entangled photon pairs, and in the rela¬ 
tionship between intra- and inter-party coherence. 

We gratefully acknowledge helpful discussions with Dr. 
Gregory A. Howland, as well as the careful editing of 
Daniel Lum, Sam Knarr, and Justin Winkler. In ad¬ 
dition, we are thankful for the support from the Na¬ 
tional Research Council, DARPA-DSO InPho Grant No. 


However, we can still find some useful properties. For 
example, the Pearson correlation coefficient (see Table 1) 
has the simple expression: 


r = 



(A3) 


In more explicit terms, we get 


r = r 0 


1 - ZiZ 2 


V ( z i + 1 )( 2 ! + 1 ) 


2 2 
(7 — (7 t 

r ° = a 2 +a 2 ’ 

W X+ ' X_ 


where ro is the correlation coefficient at z\ = z 2 = 0, 
(see Table 1). In addition, the normalized propaga¬ 
tion distances Z\ and z 2 are defined such that Z\ = 
zi/(kp<j x+ a x _), and z 2 = z 2 /(k p a x+ a x _). Using this cor¬ 
relation function for the Double-Gaussian, (as mentioned 
previously), as we move from the near field (zi,z 2 ~ 0) to 
the far field (zi,z 2 0), the initially strong position cor¬ 
relations gradually weaken and eventually become strong 
position anti-correlations in the far field. In addition, we 
also see, that when comparing a measurement of the sig¬ 
nal photon in the near field to the idler photon in the far 
field, the correlations approach zero. Since position mea¬ 
surements in the far field can be taken as measurements 
of momentum (scaled accordingly), we see that the posi¬ 
tion of one photon is uncorrelated with the momentum 
of the other (and vise versa). 
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Appendix B: Schmidt decomposition and quantum 
entanglement of the Double-Gaussian state 


In order to measure the entanglement of a pair of par¬ 
ticles, one needs to know the complete density matrix de¬ 
scribing the pair of particles. To reckon with continuous- 
variable states described by continuous wavefunctions (or 
mixtures thereof), we need to decompose such wavefunc¬ 
tions into an orthogonal basis of eigenfunctions often 
corresponding to measurably distinct outcomes of some 
discrete observable. Occasionally, the density matrix in 
such a decomposition has a finite expression (with zero 
amplitudes for all other eigenfunctions in that infinite¬ 
dimensional basis), and the analysis is straightforward. 
Other times, the density matrix has non-trivial compo¬ 
nents over its entire spectrum, and an exact determina¬ 
tion is impossible (though approximations may suffice). 

However, when the pair of particles can be described 
by a pure state (i.e., a single joint wavefunction), it is 
sufficient to know just the eigenvalues of the reduced den¬ 
sity matrix of either particle. These eigenvalues manifest 
themselves in the Schmidt decomposition of an entangled 
pure state. 

As discussed in (Fedorov et al., 2009) and (Law and 
Eberly, 2004), the Double-Gaussian state can be decom¬ 
posed into Schmidt modes. 

OO 

ip DG (x i,x 2 ) = Y, VKu n (x 1 )u n (x2 ) (Bl) 

n—0 


Here, u n (x) is the n th “energy” eigenfunction of the quan¬ 
tum harmonic oscillator, and A„, in our notation, is: 


An — 4cJ a , + CJ a ,_ 


(<Tx + ~ Vx-) 2n 
( a x+ + < 7 X _ ) 2n + 2 


AN / iV — 1A 2 " 
(iV + 1) 2 ^iV + lJ 


(B2) 

The Schmidt number I\ is expressed as the reciprocal of 
the sum of the squares of the Schmidt eigenvalues: 


K = 




(B3) 


wi and u >2 (Mikhailova et al ., 2008a,b). In doing so, we 
find that the frequency (and temporal) correlations be¬ 
tween the signal/idler photon pairs differs significantly 
whether they come from Type-I or Type-II SPDC. In 
Type-II SPDC, the polarizations of the signal-idler pho¬ 
ton pairs are orthogonal to each other, and so experience 
a different index of refraction in birefringent nonlinear 
crystals. In this case, the biphoton wavefunction (not 
counting the pump profile) depends to first order on the 
difference between the signal and idler photon frequen¬ 
cies, resulting in a sine function of the frequency differ¬ 
ence, which translates to a top-hat function of the time 
difference. In this case, the temporal correlations can be 
characterized by the (full) width W( tl _ t2 ) of the top-hat 
function, giving us: 


(Type-II) 






(Cl) 


where n*)p is the group index of the signal photon at its 
central frequency ui p / 2, and c is the speed of light in vac¬ 
uum. This width amounts to the accumulated time lag 
between the signal and idler photons due to their experi¬ 
encing different indices of refraction. Using the research 
in (Pittman et al., 1996b), where they used a 0.5mm BBO 
crystal with a pump wavelength of 351.lnm, we find that 
Wr tl -t 2 ) ~ 125fs, which agrees within experimental un¬ 
certainty with their results. 

In Type-I SPDC, the signal-idler photon pairs have 
parallel polarizations, and so they experience the same 
index of refraction in the nonlinear crystal. Conse¬ 
quently, the biphoton wavefunction (not counting pump 
profile) to lowest order depends on the square of the 
signal-idler frequency difference. As a result, the tech¬ 
niques used to estimate the transverse spatial correlation 
width o'( Xl -x 2 ) can also be applied to estimating the tem¬ 
poral correlation width cr(t 1 -t 2 ), giving us: 


(Type-I) 


(exact) 


{PM) 

(tl — 


_ / 9L z ki 

10 5 


— . / 4L z k i 


(C2) 


Interestingly, since the Schmidt eigenvalues of the 
Double-Gaussian state are geometrically distributed (a 
maximum entropy distribution for constant N), the 
Double-Gaussian state is the maximally entangled state 
for a constant birth zone number. 

Appendix C: Heisenberg limited temporal correlations in 
SPDC 

If instead of taking the small-angle approximation in 
order to get the transverse biphoton wavefunction (26), 
we look directly at the longitudinal component of the 
wave vector mismatch A k z , we can express the biphoton 
state in terms of the signal and idler photon frequencies 


i2 i 

where \ is the group velocity dispersion con¬ 

stant at half the pump frequency. 

Because the signal and idler photons experience the 
same index of refection in Type-I SPDC, their tempo¬ 
ral correlation width can be much smaller than for pairs 
originating from a Type-II source. Indeed, for a 3mm 
BiBO crystal cut for Type-I SPDC, with a pump wave¬ 
length of 775nm, the temporal correlation width <7^^^ 
is predicted to be approximately 2.0fs, nearly two or¬ 
ders of magnitude below the width for a typical Type- 
II source. With a narrowband pump spectrum, the ra¬ 
tio between cr( tl+t2 ) and (J( tl _ t2 ) can be as large as 10 9 , 
a degree of temporal correlation outstripping any spa¬ 
tial correlations discussed thus far. However, even with 
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negligible jitter times and noise in photon counting ap¬ 
paratuses, these temporal correlations are not accessible 
unless the experimental setup used to measure these cor¬ 
relations accepts a relatively large range of frequencies; 
the degree of measurable temporal correlations is limited 
by the frequency-time Heisenberg relation (Mandel and 
Wolf, 1995). 

Using the conditional uncertainty relation(s) (39), we 
can study the tradeoff between the narrowness of the 
frequency spectrum of the signal or idler photon given by 
the standard deviation a u , 1 , and the temporal correlation 
width Using the same steps leading to (39), we 

find 

°Ui cr (ti-t 2 ) — 2’ (C3) 

This is understood because the standard deviation is 
invariant to constant shifts, and that conditioning on 
average never increases the variance. Thus, we know 
CT( tl _ t2 ) > er ((tl _ t2) | t2 ) = cr (tl | t2) . Consider that a typ¬ 
ical source of down-conversion has a frequency width a ull 
of the order 2 x 10 14 (radians per second) (Baek and Kim, 
2008). This implies that the smallest possible time cor¬ 
relation width <7( tl _ t2 ) for this source is of the order of 
4 femtoseconds. However, if we perform frequency filter¬ 
ing to look at the nearly degenerate part of the biphoton 
frequency spectrum, we increase the minimum resolvable 
time correlation width by a factor inversely related to the 
fraction of frequencies allowed to pass through the filter. 
If we consider a 2nm filter centered at 1550nm making 
a u i k, 7.8 x 10 11 (radians per second), then the minimum 
resolvable value of &( tl -t 2 ) would be about 6x 10 2 fs, more 
than two orders of magnitude wider that what is achiev¬ 
able without filtering. 
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